Network Coding Based on Chinese Remainder 

Theorem 



Zhifang Zhang 

Key Laboratory of Mathematics Mechanization, AMSS, Chinese Academy of Sciences 

E-mail: zfz@amss.ac.cn 



(N- 

o: 

(N. 
< 

o: 

(N. 



>: 
vo, 

OV 

en; 

00' 

o: 



X 



Abstract — Random linear network code has to sacrifice part 
of bandwidth to transfer the coding vectors, thus a head of size 
k log |T| is appended to each packet. We present a distributed 
random network coding approach based on the Chinese remain- 
der theorem for general multicast networks. It uses a couple of 
modulus as the head, thus reduces the size of head to 0(log k). 
This makes it more suitable for scenarios where the number of 
source nodes is large and the bandwidth is limited. We estimate 
the multicast rate and show it is satisfactory in performance for 
randomly designed networks. 

Index Terms — Network coding, the Chinese remainder theo- 
rem, multi-source multicast networks 

I. Introduction 

In their pioneering work, Ahlswede et al H) state multicast 
rate can be close to the max-flow bound by allowing network 
coding instead of just routing. Then linear coding schemes for 
general multicast networks are designed [9|, |6| which achieve 
the optimum multicast rate. Inspired by these theoretical 
results, network coding has become a promising technique 
to be applied in networking applications, such as wireless 
networks and content distribution networks. 

In practice, random linear network coding JH is more 
preferable since it is suitable for dynamic networks. But this 
approach has to sacrifice part of bandwidth to keep track of 
the linear combinations chosen currently. Namely, each packet 
needs append a head which is a vector (called the coding 
vector) indicating the combination coefficients associated with 
this packet. The overhead of coding vectors is acceptable for 
large packets, however, in wireless applications, such as sensor 
networks where packets are much shorter and bandwidth is 
very limited, it can very fast become prohibitive. We restate 
an example described in [ 3 1 and ifTTI . 

Example 1. Consider a sensor network consisting of 100 
nodes, each sending a message to a sink. To implement a 
network code over a field of size q = 2 4 , the coding vector is 
in F^ and so is of 50 bytes. But in an usual sensor network, 
such as TinyOs operating system, a typical frame length allows 
approximately 30 bytes for data transmission. Thus just the 
coding vector alone will exceed the bandwidth limit. 

To shorten coding vectors, paper IfTTI proposed a com- 
pression approach by constraining the number of nonzero 
components of each coding vector no larger than a fraction of 
the total dimension. But in practice, to achieve good multicast 
rate, this fraction cannot be too small and it is difficult for 
internal nodes to maintain this constraint. This approach was 



later improved by using erasure decoding and list decoding at 
the cost of increasing decoding complexity at receivers |8|. 

Another linear coding approach is subspace coding [7] 
where messages are mapped into linear subspaces to be 
transferred and thus no coding vectors are needed. But it 
achieves the same information rate as the coding vector based 
approach when the packet length increases (3). Moreover, a 
large codebook must be maintained at the source and sink 
nodes, and designing subspace codes for multi-source network 
coding is very difficult [11]. 

In this paper we propose a distributed random network 
coding approach based on the Chinese remainder theorem 
(CRT) for general multicast networks. Unlike the random 
linear network code, it uses a couple of modulus as the head. 
The existing random linear network coding approaches |4|, 
including the compression approach and its improvements 
ifTTIl . (S), all need coding vectors of size fclogg assuming 
coding over a field ¥ q , while in our coding scheme the 
counterpart is of size O(logfc). 

Before our work, the Chinese remainder theorem has been 
used in network coding J2), but they use CRT-based coding 
only at source nodes, and just routing at internal nodes. We 
use CRT-based coding at each node, therefore it achieves a 
higher multicast rate than just routing. Meanwhile, computa- 
tion performed at each node in our scheme can be simplified 
by pre-computation. 

The paper is organized as follows. Section II introduces the 
coding vector based approach. Section III describes our CRT- 
based coding approach for both single source multicast and 
multi-source multicast. Section IV gives an elementary esti- 
mation of the multicast rate and displays some experimental 
results. 

II. Preliminaries 

A. Coding vector based approach for multicast 

Linear network coding is a widely studied approach in 
the literature. For convenience, let the alphabet F be an l- 
dimensional vector space over a finite field ¥ q , i.e., F = F z 
for some positive integer I, and J 7 be a finite dimensional 
vector space over F. A linear network code can be described 
as follows. Let xi,...,Xfc € F^ be the information sources. 
Each receiver i £ T receives (si, s u ). Since each node 
performs linear operations on its input symbols to generate its 
output symbols, s, is a linear combination of xi, ...,Xfc. That 



is, the receiver t gets the following system of linear equations. 
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The ith row of the coefficient matrix A in (HJ is the coding 
vector corresponding to the received packet s,. 

Obviously, the receiver needs the knowledge of coding 
vectors to solve the equations Q} and recover the source 
information xi,...,x&. In deterministic network coding (6), 
||9), the coding vectors are pre-determined based on the 
network topology. For networks with unknown or changing 
topologies, distributed random coding [4| is a more useful 
approach. That is, each node independently chooses a random 
linear combination of its received packets and generates its 
output packets. Since these linear combinations are randomly 
chosen in a distributed setting, each packet needs append a 
head to record the coding vector associated with this packet. 
Namely, each link e € E transmits 

§e = [ Pe | S e ] , 

where s e G F ' is the information packet and p e G F^ is the 
head such that 

S e = p e (xi, ...,Xfc) T . 

Therefore, in random linear network coding partial bandwidth 
must sacrifice to keep track of the coding vectors. In general, 
it requires q > |T| to devise a linear network coding over W q 
El, @. When k and |T| are quite large while the bandwidth 
is limited, this coding vector based approach turns out to be 
constrained. 

B. The Chinese remainder theorem 

The Chinese remainder theorem (CRT) is a result about 
congruences in number theory. We refer to [10] for the main 
results introduced in this section. 

Theorem 1. (The Chinese Remainder Theorem) 

Let mi,m2,...,m r be pairwise relatively prime positive 
integers. Then the system of congruences 

x = cii (mod rrii), 1 < i < r 

has a unique solution modulo M = m\mi ■ ■ ■ m r . 

For two integers a, b, let gcd(a, b) denote the greatest 
common divisor of a and b, and lcm(a, b) denote the least 
common multiple. Then we consider systems of congruences 
when the modulus are not pairwise relatively prime. 

Theorem 2. The system of congruences 

x = a\ (mod m\) 
x = ci2 (mod m 2 ) 

has a solution if and only if gcd(mi, mi) \ (a\ — 02). If there 
is a solution, it is unique modulo lcm(m\,vnri). 



Corollary 1. The system of congruences 

x = a,i (mod mi), 1 < i < r (2) 

has a solution if and only if gcd(rrii,mf) \ (a^ — af) for all 
pairs of integers (i,j) with 1 < i < j < r. If there is a 
solution, it is unique modulo lcm(nii, ...,m r ). 

It can use the extended Euclidean algorithm to get a solution 
to (ffj in polynomial time. 

III. Network Coding Based on CRT 

A. Single source multicast 

Let s G V be the source node and T C V be the set 
of receivers. Denote |ou£(s)| = k and \in(t)\ — l t for 
t G T. Let the information source be an n-bit integer X, 
i.e., [logX] = n. In the following, we design a network code 
for the single-source multicast problem based on the Chinese 
remainder theorem. 
Coding at the source: 

(s.l) The source s randomly selects 2k distinct rn-bit primes 
Pi, ■■■,P2k- We will discuss the selection of m in Part B. 
(s.2) For 1 < i < k, the ith output link of s transmits 

[X mod P2i-lP2i I P2i-l,P2i] ■ 

Coding at the internal node v G V: 
(v. 1) Suppose the node v gets from all its input links the input 

[ai I qi,q2],—,[ai v | q% v -i,q%X where l v = \in(v)\. 
Then v solves the system of congruences 

x = a t (mod fe-ife): 1 <i <l v 

and gets a solution x = a (mod lcm(qi, g 2 z„)). 
(v.2) v randomly picks qi,qj G {q%, q2i^} and computes 
a' = a (mod qtqj). Then its output links transmit [a' \ 

Qi,qj]- 

Decoding at the receiver node t G T: 

(t.l) Suppose the receiver t gets from its input links the input 

K I qi,<7 2 ],...,K f I q' 2k _ v q' 2k ]. Then t solves the 
system of congruences 

x = a'i (mod q^^i), 1 < i < h 

and gets a solution x = c (mod lcm(q' 1 , q' 2[ )). 

(t.2) Denote lcm(q[, ...,q' 2h ) = N and let < c < N. If 
|~log(c + N)1 > n, then t recovers X = c. Otherwise, t 
concludes that X = c + IN for some integer I. 
The following example illustrates our coding approach on 

the butterfly network. 

Example 2. The source s is to multicast the information X = 
200 which is a 8-bit integer to the receivers t\ and t 2 . It 
chooses primes 3,5,7,11 and transmits the information as 
shown in Figure [7] 

The receiver t\ gets the system of congruences 

x = 2 (mod 3x11) 
x = 46 (mod 7 x 11) 
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Fig. 1. Source s is to multicast X = 200 to t\ and t2 



By using the extended Euclidean algorithm it obtains a solu- 
tion x = 200 {mod 231). Since the information source is at 
most 8-bit in length (which is regarded as a predetermined 
information publicly known to all nodes), t\ can deduce 
X = 200. 

B. Parallelization 

Let (X\, X u ) be the information sources where Xi is 
an n-bit integer. The information is transmitted in the same 
way as described in Section ITlI-AI except that each node deals 
with u systems of congruences simultaneously, each for a 
information source X$. Thus each packet is of the form 

[on, ...,a u | p,q] 

where a; is congruent to Xi modulo pq. Suppose a receiver 
t e T gets inputs [a' y , a' uj \ fty-ijfty], 1 < j < h- It can 
solve the u systems of congruences 

Xi = a! i:j (mod gjy-i^y), 1 < 3 < h, 1 < i < « , 

and get the solutions 

Xi = Ci (mod lcm(gi, ...,q 2 i t )), 1 < % < u . 

In practice, the choice of u depends on the bandwidth and 
size of the modulus, i.e., the bit size m. First, determine m 
according to how many primes are needed in the network code. 
For example, in a sensor network with 30-byte bandwidth 
and 100 source nodes, our CRT-based network code needs 
200 primes. The prime number theorem [5| approximates 
the number of primes no more than n by By this 

approximation, we know the number of 16-bit primes is more 
than 10 3 which is absolutely enough for ordinary sensor 
network. Let m — 16, then each head message is of 4 bytes 
taking 13.3% of the bandwidth. 

Step (t.2) shows there is a possibility that the receiver can 
only get partial information of the message. Actually fixing 
Xi to be an integer of size less than 2m-bit can eliminate 
this possibility. Then adjusting the value of u according to the 
bandwidth and size of the message to be transferred. 

C. Multi-source multicast 

In sensor networks, the information sources are usually gen- 
erated at distributed multiple source nodes and each receiver 
tries to collect the information from all sources. In this section 
we demonstrate how the CRT-based network code works for 
multi-source multicast. 



Suppose there are k information sources X\, ...,Xk gener- 
ated at the distributed source nodes s%, sj. respectively. Let 
T be the set of receivers. Each receiver i£T tries to get the 
information Xi,...,Xk- Certainly, we assume that there is a 
path from source Si to t for 1 < i < k and all these k paths 
are edge disjoint. 

First, each source node determines a pair of m-bit primes 
as its identity, making sure that different sources do not 
have common primes. That is, let p±, ...,p2k be m-bit primes 
different from each other, and (p2i-i>P2i) be s^'s identity for 
1 < i < k. The identity can be easily determined in an initial 
phase. Once they are determined, they are fixed for all the 
transmission thereafter and become the common knowledge 
to all nodes. 

Without loss of generality, we assume Xi is a (2m — 1)- 
bit integer since integers of larger size can be cut in parts and 
transmitted in parallel. Then for 1 < i < k the source Sj sends 
[Xi | p2i-i,P2i] on each of its output links. In this way the k 
sources jointly determine an integer X satisfying the system 
of congruences: 

X = Xi (mod p 2i -iP2i), 1 < i < k . 

From Theorem [2] we know this system is solvable although 
the equations are determined in a distributed setting. 

The internal nodes do the same as in steps (v. 1) and (v.2). 
Then for a receiver t 6 T with l t input links, it solves the 
system of congruences as in step (t. 1) and gets a solution 
x = c (mod lcm(q' 1 , q' 2l )). It is easy to see that X = 
c (mod lcm(^,...,g^ t )) and {q[, ...,q' 2lt } C {pi, ...,p 2k }- 
There are three cases for the receiver t recovering Xf. 

(1) If {p2i-i,P2i} Q Wi, —,Q2i t }' then t can recover X l as 
Xi = c (mod p 2 i~iP2i) ■ 

(2) If {p2i-i,P2i} H {q[, ■■■,<72/ t } = {P2i}, then t computes 
c' = c (mod p2i) and concludes Xi = d + bp2i for 
some integer b. A similar conclusion can be made when 
{p 2l -i,P2 l } r\ {q[, q' 2lt } = {p 2l -i}- 

(3) If {p2i-i,P2i} n {q{, —,q' 2 i t } = 0, then t cannot recover 
X h 

Although there is a chance that t cannot recover Xi or just 
know partial information about Xi, we will show in section 
[IV] that this chance can be very small when l t is large enough. 

D. Simplifying computation 

In steps (v.l) and (v.2) of the CRT-based network coding, 
the internal node with l v input links needs to solve a system of 
l v congruent equations and then pick two primes as its output 
modulus. Actually, this process can be simplified. The internal 
node first picks two primes from all his input primes, say, p 
and q. Let [a, | p,p'] and [oj | q, q'] be two packets it received 
containing the picked primes p and q. Compute 

a = cii (mod p) and b = dj (mod q) . 

Then, it has a system of two congruences: 

X = a (mod p) and X = b (mod q) . (3) 
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It is much easier to solve the system Q containing only 
two congruent equations and get a solution of X modulo pq. 
Similar simplifications can be made at the decoding steps in 
multi-source case. 

IV. Estimation of multicast rate 

For the single source multicast , a receiver can always 
recover the message as stated in Section III-B. For the multi- 
source case, a receiver t finally gets modulus lcm(<^, q' 2 i t ), 
then he can recover X t if {p 2 i-i,P2i} Q {q[, •••,<?2i f }• We 
interested in the number of Xj's that a receiver can recover. 

Lemma 1. Let fc, I be positive integers. Let Si = {2i — 1, 2i} 
for 1 < i < k and S = U^ =1 Si. For 1 < j < I, a subset 
Aj C S with \Aj \ — 2 is independently and uniformly chosen. 
Denote 

Is = {* I 1 < t < fc, Si C \J j=1 Aj} . 
Then the expectation of \Is\ is k{l — (1 — j) 1 ) 2 - 

Denote I = rk. Since lim (1 - ±) fc = ± « 0.3679, we 

approximate (1 — -|) fe by 0.367 as A: is a three-digit number. 
Then the expectation of is fc(l — 0.367 r ) 2 . Define the 
recover rate as R* — \Ig\/k. Thus the expectation of R* is 
about (1 — 0.367 r ) 2 . Since a receiver tries to collect all source 
information, it is reasonable to assume that for any t e T, 
It > fc, i.e., r > 1. Table U lists our estimation of R* at 
some points of r. It can see as r increases the recover rate R* 
becomes more and more satisfactory. 
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3.5 
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R* 


0.40 


0.60 


0.75 


0.84 


0.90 


0.94 


0.96 



TABLE I 



A. An experiment 

Our estimation is based on the probabilistic event described 
in Lemma Q] In the following, we show this estimation does 
not deviate the real performance too much for a randomly 
designed network. 

The network is designed as follows. First, all nodes in 
the network are divided into L + 2 levels, denoted as 
Vq, Vi, Vl, Vl+i C V, and each level contains M,; nodes, 
i.e., |V»| = Mi. Let Vq be the set of sources and Vl+i be the 
set of receivers, and the rest be internal nodes. For < i < L, 
each node in Vi independently links to aMi + i nodes which 
are randomly and uniformly chosen in Vi+i. 

In a experiment, set Mq = 100, Ml+i = 10, er = 0.8, 
and Mi = • • = Ml = M. Then implement the CRT-based 
network code on such a network at different values of M 
and L, and record the number of primes each receiver finally 
gets. The results is displayed in Table [TTJ where tj means the 
number of primes that the ith receiver finally collects and 1Z' = 
10 200 1 ' w hich is close to the real recover rate. Note in the 
experiment r w S^. It can see when r increases, the recover 
rate improves evidently. Meanwhile, each link only takes a 



fraction (for example 13.3% as described in Section III-B) of 
its bandwidth to transfer the head message, which means the 
multicast rate is close to the optimal rate determined by the 
max-flow bound. 
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0.861 
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400,5 
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194 


194 


194 


0.965 


400,3 


187 


195 


193 


195 


194 


195 


192 


192 


190 


195 


0.963 



TABLE II 



Actually it is difficult to achieve the optimal rate for all 
distributed random network coding, since partial bandwidth is 
taken by the head message. The main advantage of our CRT- 
based network coding is the great reduction in head message. 
Suppose there are fc source nodes or the information source is 
of fc dimensions, and there are \T\ receivers. Then the coding 
vector based network code needs to convey the head message 
which is a fc-dimensional vector over ¥ q , where q > |T|. Thus 
the head message is of size fclog|T|. Sometimes the size of 
this head message alone will exceed the bandwidth. While 
our CRT-based network code only needs to convey a pair of 
primes chosen from 2k distinct m-bit primes. By the prime 
number theorem, the primes are of size O(logfc). Therefore, 
comparing with the coding vector based approach, we reduce 
the size of head message from fclog \T\ to O(logfc). 
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